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Abstract 

We study geometric properties of coordinate projections. Among 
other results, we show that if a body K C R™ has an "almost ex- 
tremal" volume ratio, then it has a projection of proportional dimen- 
sion which is close to the cube. We also establish a sharp estimate 
on the shattering dimension of the convex hull of a class of functions 
in terms of the shattering dimension of the class itself. 



1 Introduction 

In this article we present several results on coordinate projections. The 
majority of this article is devoted to new applications of the entropy in- 
equality established in |MVj , which, roughly speaking, states that if a set 
of functions has a large entropy in L2, it must have a coordinate projection 
which contains a large cube. 

Definition 1.1 We say that a subset a of Q is i-shattered by a class of 
real-valued functions F if there exists a level function h on a such that, 
given any subset a' of a, one can find a function f & F with f(x) < h{x)—t 
if x £ a' and f(x) > h(x) + t if x £ a \ a' . 

The shattering dimension of A, denoted by vc(F,£l,t) after Vapnik 
and Chervonenkis, is the maximal cardinality of a subset of £1 which is 
t-shattered by F. In cases where the underlying space is clear we denote 
the shattering dimension by vc(F, t) . 
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Theorem 1.2 Let A be a class of functions bounded by 1, defined on a 
set f2. Then for every probability measure [i on 

/2\ K-yc{F,c£) 



where K and c are positive absolute constants. 

Every F C W 1 can be identified with a class of functions on {1, ...,n} in 
the natural way: v(i) = Vi for v £ F. If we take /i to be the probability 
counting measure on {1, ...,n} then states that for any < t < 1, 



Note that if F happens to be convex and symmetric with respect to the 
origin, then vc(i ? , t) is the maximal cardinality of a subset a of {1, . . . , n} 
such that P a {F) D [-M] CT . 

We apply this result to study convex bodies whose volume ratio is al- 
most maximal. Recall that the volume ratio, introduced by Szarek and 
Tomczak-Jaegermann |SllSTj. is defined as vr(D) = (|D|/|£ |) 1/n , where £ 
is the ellipsoid of maximal volume contained in D. 

The minimal volume ratio of a symmetric convex body in M. n is 1 and 
is attained by the Euclidean ball; the maximal is of the order of y/n and 
is attained by the cube [Bj. This pair of extremal bodies is unique up 
to a linear transformation. Indeed, the uniqueness of the minimizer is im- 
mediate, while the fact that cube is the unique maximizer was established 
in [Raj . 

The isomorphic version of this fact - describing the bodies whose volume 
ratio is of order either 1 or y/n is of particular interest. The question is 
whether such bodies inherit any structure from the Euclidean ball or, 
respectively, from the cube. 

If the volume ratio of a body D in W 1 is bounded by a constant, then 
by the Volume Ratio Theorem |ST| , D has a section of dimension propor- 
tional to n, which is well isomorphic to the Euclidean ball. 

On the other hand, if vr(D) is of order of y/n, then by [R| and [V] 5 D 
has a section of dimension proportional to y/n, which is well isomorphic to 
the cube, and the order of y/n in the dimension can not be improved (the 
dual of Gluskin's polytope is such an example - see section EJ. However, 
we show in section El that there exists a projection of D of dimension 
proportional to n, which is well isomorphic to the cube. 




< t < 1, 



(1.1) 
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Two other applications we present are based on the following corollary 
of Theorem 11.21 

Theorem 1.3 ' L MVl Let F be a class of functions bounded by 1, defined 
on a finite set I of cardinality n. Then the gaussian process indexed by F, 
x f = YZ=l9if(i) satisfies 

E = E sup X f < Kyfn \ \/vc(F,t) -log(2/t) dt, 

f€F JcE/n 

where K and c are absolute constants. 

One application we present is a comparison of the average E|| Y27=l e i x i\\ 
to the minimum over all choices of signs, min || Y17=i ^^ll- As a conse- 
quence, we compare the type 2 and the infratype 2 constants of a Banach 
space. 

Then, we establish a sharp estimate on the shattering dimension of a 
convex hull of a class of functions, based on the shattering dimension of 
the class itself. Namely, we show that for every e > 0, 

vc(conv(F),e) < (C/e) 2 -vc(F,ce), 

where c and C are absolute constants. 

The final question we address is when a random coordinate projection 
an "almost isometry". Let be a probability space and / G £2 (/•*)) 

and for simplicity, assume that f2 = {1, ...,n} and that \i is the uniform 
probability measure on 0,. For every e > 0, our aim is to find "many" 
sets a C {l,...,n} of small cardinality such that the natural coordinate 
projection P a f satisfies 

(l-s)\\f\\ L n<\\P (T f\\ L ,<(l+s)\\f\\ L n, (1.2) 

where L\ is the L2 space defined on {1, k} with respect to the uniform 
probability measure. 

By a standard concentration argument, if ||/||oo < 1, then with high 
probability a random coordinate projection of dimension C/e 2 is an almost 
isometry in the sense of (|1.2|) . We will show that the uniform boundedness 
of / can be relaxed; it suffices to assume that ||/||^ 2 < 1, where || ||^ 2 is 
the Orlicz norm generated by the function e* — 1. In this CclS6 ? Si random 
coordinate projection of dimension C/e 2 will be an almost isometry as 
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in (|l,2j) with high probability. Although this result is relatively easy, we 
decided to present it because it gives hope that the conditions in stronger 
concentration inequalities (e.g. Talagrand's concentration inequality for 
empirical processes |T 941 IE] ) can also be relaxed. As an application, we 
obtain a coordinate version of the Johnson-Lindenstrauss "Flattening" 
Lemma |JL| . 

Finally, we turn to some notational conventions. Throughout, all abso- 
lute constants are denoted by c, C, k and K. Their values may change 
from line to line or even within the same line. We denote a ~ b if there 
are absolute constants c and C such that cb < a < Cb. 
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2 Extremal volume ratios 

The volume ratio of a convex body D in lR n is defined as 

/\D\\ 1 / n 
vv(D) = inf (U) , 

where | | denotes the volume in M n , and the infimum is over all ellipsoids 
£ contained in D. This important invariant was introduced by Szarek and 
Tomczak-Jaegermann (see jS], |STj or |F]). 

The bodies with extremal volume ratios are the Euclidean ball and the 
cube - and these are the only extreme bodies up to a linear transformation 
(for the uniqueness of the cube, see |Baj ) . One can show that for every 
convex symmetric body in R™, 

l = vr(B?)<vr(D)<vr(BS>), (2.1) 

(see [B]), while direct computation shows that vr(i^) < C\fn and the 
best value of the constant is C = 2/y / 7re. 
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Often, one encounters bodies whose volume ratio is almost extremal, 
i.e. close to one of sides of ()2.1|) , The problem is whether such a body 
inherits properties of the extremal bodies, the Euclidean ball or the cube. 

If vi(D) < A, then by the Volume Ratio Theorem |ST| . D has a section 
of dimension k = n/2 which is cA 2 -isomorphic to the Euclidean ball B%, 
and this result is asymptotically sharp. 

On the opposite side of the scale, if vi(D) > A^y/n, D has a section of 
D of dimension k = c(A)y/n which is C(A) log n-isomorphic to the cube 
|K1 IV|. It is not known whether the logarithmic term can be elimi- 
nated, but the order of \/n in the dimension is optimal, as was noticed 
in |GTT| . Indeed, by an argument of Figiel and Johnson (see |FJj . cor. 
3.2), a random subspace E C 1^ (and thus a dual of Gluskin's space) of 
dimension at least n/2 satisfies that for any F C E, gl(F) > cdim(F)/\/n, 
where gl(F) is the Gordon-Lewis constant of F, and c is a suitable ab- 
solute constant. By |GL| . gl(F) < unc(F), where unc(F) is the least 
unconditionality constant of a basis of F. Since unc(^) = 1 then 

cdim(i ? ) 



d(F,C)>unc(F)> 



>n 



and thus, if F is 2-isomorphic to £^ then dim(i ? ) < c'^/n. 

Our next result shows that D has a projection of dimension proportional 
to n which is cA- isomorphic to the cube B^. 

Theorem 2.1 There are absolute constants C and c for which the fol- 
lowing holds. If D is a convex symmetric body in W 1 for which vr(D) > 
A~ l y/n, then there exists a projection P of rank k > cnj log A such that 

d(PK,B^) < CA. 

To prove the Theorem, recall the notion of the cubic ratio 0. For every 
ball Dei" one defines 

/\B n \ \ 1 
cr(Z)) = inf (^gj) 

where the infimum is over all linear invertible operators T on W 1 such that 
TD C AS,. 

Lemma 2.2 0/ There are absolute constants c and C such that for every 
integer n and every convex symmetric body D C M. n , 

cy/n < vr(D) • cr(D) < Cy/n. 
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Proof of Theorem 12.11 Clearly, we can assume n to be larger than 
a suitable absolute constant N, which ensures that for every D C M. n , 
vr(-D) < 0.8^. Since vr(D) > A~ l y/n, then by LemmaEl cr(D) < CA 
Hence, there is some T S GL n such that 

TflcC and |TD|n> C /A 

Recall that c" < | ^fnB^ | < cJ? for some absolute constants ci , C2 , and thus 
there exists an absolute constant C3 such that 



\c 3 A-^B^)\- 

By a standard volumetric argument, the right-hand side is bounded by 

N(TD, czA~ X y[nBV[), 
and by Theorem II .21 there are absolute constants K and c for which 

n < log N(TD, csA^y/nBg) < if • vc(T£>, c^T 1 ) log(CA). 
Hence, there is a set <r C {1, ■■■,n}, such that \a\ > n/K\og{CA) and 
d^B* C P a {TD) C . 

It only remains to note that log(CA) < Clog A, because A > 5/4. ■ 

Remark. Since the volume ratio is always greater than 1, then A > 
n -i/2 Therefore, the dimension of the cubic projection in Theorem 12. II is 
always bounded below by enj log n. 

In a very similar way, one can prove the following 

Theorem 2.3 There are absolute constants C and c for which the follow- 
ing holds. Let D be a convex symmetric body in 1" for which vr(D) > 
A -\/n. Then there exists a projection P of rank k > cn such that 

d(PK, B^) < CA 2 . 



3 Type and Infratype 

In this section we improve a result of M. Talagrand |T 92j which com- 
pares the average over the ± signs to the minimum over the db signs of 

II Si=l ± x i\\- 
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Recall that a Banach space X has a (gaussian) type p if there exists 
some M > such that for all n and all sequences of vectors (xi)i< n , 



i 



(3.1) 



2=1 



The best possible constant M in this inequality is denoted by T p (X). We 
say that X has infratype p if there exists some M > such that for all n 
and all sequences of vectors (xj)j< n , 



n n 



i=l 



X ; 



(3.2) 



i=l 



The best possible constant M in this inequality is denoted by I p (X). 

In |T~92] it was shown that if 1 < p < 2 then T p (X) < C p I p (X) 2 , 
where C p is a constant which depends only on p. It is not known whether 
the square can be removed. Regarding the case p = 2, M. Talagrand 
recently constructed a symmetric sequence space which has infratype 2 
but not type 2 [T 03]. Hence one can not obtain dimension free estimates 
on T2(X) in terms of I2(X). Our main result in this section is that is 
dim(X) = n then T 2 {X) < CI 2 {X) ■ log 3/2 n. 

We begin with the following fact that allows one to compare Rademacher 
and Gaussian averages. 

Lemma 3.1 There is an absolute constant C for which the following 
holds. Let x\,...,x n be vectors in the unit ball of a Banach space and 
let < M < y/n. IfO<\< log~ 3 (n/M 2 ) and 

i»=±l II ^ ' 11 
for all a C {1, ...,n} with \a\ < An, then, 

n 

Ejj29iXi\\ I^CMin/X) 1 / 2 . 
i=i 

In the proof of Lemma 13.11 we require the following observation from 
MS , that if {xi, x n } C X is e-shattered by Bx* , then for any (a 4 )" =1 £ 



(3.3) 



i=l 
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Proof of Lemma 13.11 Clearly we can assume that the given Banach 
space is X = (IR n , || • ||) and that (xj)j< n are the unit coordinate vectors 
in W 1 . Set B = Bx* and by the hypothesis of the lemma and (|3.3j) . 

vc(B,Mv- 1 / 2 ) < v if < v < An. Hence, for any M(An)" 1 / 2 <t<l, 

vc(B,t) < {M/tf. (3.4) 

Set 

n n 

E = E || ^9iXi = Esup^flfj b{i). 

i=l b£B 

By Theorem 11.31 there are absolute constants C and c such that 
E < Ky/n~ [ ^vc(B,t) -log(2/t) dt. 

JcE/n 

If cE/n < Af(An) -1 / 2 , the lemma trivially follows. Otherwise, if the 
converse inequality holds, then by (|3.4|) and since A < 1, 

E < K^fn [ (M/t)y / log(2/t) dt < KyfnM ■ logi(n/Af 2 ), 

JcE/n 

and by the assumption on A, 

E < Ky/nM ■ logi (n/M 2 ) < K^nM/Vx, 

as claimed. ■ 

Using Lemma f3.ll one can compare the type and infratype 2 of a Banach 
space X. 

Let (X) and 1% (X) denote the best possible constants M in (|3.1|) 
and 1)3.2(1 respectively (with p = 2). So, T 2 (n) (X) and 4" } (X) measure 
the type/infratype 2 computed on n vectors. Clearly, h(X) < T2(X) and 

4 n) (x) < r 2 (n) (x) < ^R. 

Theorem 3.2 Let X be an n- dimensional Banach space. Then, for every 
number < A < log~ 3 (n/I 2 {X) 2 ), 

T 2 (X) <C-I { 2 Xn) (X)/y/\. 
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In particular, 



T 2 (X) < I 2 (X)-C\ogi 



n 



h{Xf 



< I 2 {X) ■ Clog 2 n. 



Proof. By jTjj and |BKT] Theorem 3.1, the Gaussian type 2 can be 
computed on n vectors of norm one. Precisely, this means that T 2 (X) is 
the smallest possible constant M' for which the inequality 



i=l 



holds for all vectors x±, 
from Lemma 13.11 



< M'n 1 / 2 

x n of norm one. Now, the assertion follows 



4 The shattering dimension of convex hulls 

In this section we present a sharp estimate which compares the shattering 
dimensions of a class and of its convex hull. To that end, we connect 
the shattering dimension to the growth rate of the expectation of the 
supremum of the gaussian process {X a , a £ P a F} as a function of |cr|. 

Definition 4.1 Let F be a class of functions bounded by 1 and set 

n 

£ n (F) = sup Eg sup \y~]gif(xi)\, 
(xi,...,x„)en n feF i=1 

where g\,—,g n are independent, standard gaussian random variables. 

Hence, l n {F) is the largest gaussian average associated with a coordi- 
nate projection of F on n points. Since F C i?(L oo (0)) then for every 
a = (xi, ...,£„), 

P a F = { (f( Xl ), f{x n )) : /£f}cC, 

and the largest projection one might encounter is when P a F = B^, in 
which case i{P a F) ~ n. We define a scale-sensitive parameter which 
measures for every e > the largest cardinality of a projection which has 
a "large" ^-norm: 

t(F,e) = sup{n : l n (F) > en}. 
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Theorem 4.2 There are absolute constants K , c and c' such that for any 
F C B^L^Q,)) and every e > 0, 

vc(F,c'e) < t(F,e) < (K/e 2 ) ■ vc(F,ce). 

In the proof, we will use the following wording: the function / associated 
to a set a' in the Definition II . II will be called the function that shatters 
a'. 

Proof of Theorem 14.21 Assume that {x%, x n } is e-shattered by F. 
For every J C {xi, x n }, let fj be the function shattering J, and for 
each (ei, ...,e n ) G {—1, l} n set I = {xi\ei = 1}. By the triangle inequality 
and letting / = //,/' = //= in the second inequality below, 



sup 



> 



1 



> - sup 

1 /,/' GF 



=1 

n 



J>(/(*i) - /'(a*)) 



4=1 



4=1 



Hence, 



sup 

/GF 



^£if{Xi) 



1=1 



> ne. 



> ne, 



and in particular this holds for the average. The first bound is evident 
because of the known connections between gaussian and Rademacher av- 
erages |TJlj . namely, that there is an absolute constant C such that for 
any class F and any set a = (x\, ...,x n ), 



?( p aF) = Eg || ^g i e i || ( p CTj p)o > C -E e || ^EieiW^py 



4=1 



4=1 



= C -E e sup \ y y]e i f{x i )\. 

The reverse inequality follows from Theorem 11.31 in a similar way to the 
proof of Elton's Theorem in |MVj. If £(P a F) > en, then Esup feF X f > 
ne, where Xj = Ya=1 9if( x i)- By Theorem 11.31 

ne < E sup X f < Kjn~ \ y/vc(P a F, t) ■ log(2/t) d*. 

/GF ,/ce 
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Set v(t) = tlog i C i( 2 /t) w here Co > is chosen so that f Q v(t) dt = 1. Hence, 
there is some ce < t < 1 such that K 2 ■ vc(P a F,t) > e 2 n ■ v 2 (t) / log(2 / 1) , 
implying that 

c'e 2 

vc(F,ce) > vc(P CT F,ce) > vc(P a F,t) > — 32 , 0/ . n > c"rwr 2 . 

t z log (2/t) 

■ 

The previous result can be used to estimate the shattering dimension 
of a convex hull of a class. 

Corollary 4.3 There are absolute constants K and c such that for any 
F C 5(Loo(r2)) and every e > 0, 

vc(conv(F),e) < (K/e) 2 -vc(F,ce). 

Proof. Since the ^-norm of a set and of its convex hull are the same, then 
for any e > 0, t(F, e) = £(conv(F), e). By Theorem IO 

vc(conv(F),e) < i(conv(F),e) = t(F,e) < {K/e) 2 ■ vc(F,ce). 

■ 

Next, we show that this estimate is sharp, in the sense that the exponent 
of 1/e 2 can not be improved. To that end, we require some properties of 
the shattering dimension of classes of linear functionals mentioned before, 
which was investigated in |MSj . 

If X is a normed space then F>x* can be viewed as a subset of Loo(Bx) in 
the natural way. It is not difficult to characterize the shattering dimension 
in this case. 

Lemma 4.4 A set {x\, ...,x n } C Bx is e-shattered by Bx* if and only if 
(xi)f =1 are linearly independent and e-dominate the unit-vector basis; 
i.e., 

n n n 

e^2\ai\ < || ^OiXijl < ^2 \ a i\ 

i=l i=l i=l 

for every a\, a n £ R. 

In particular, if X is n- dimensional, and if the Banach-Mazur distance 
satisfies that d(X,£ 1 l) < a, then vc(Bx* , Bx, = n. 
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Corollary 4.5 There exists an absolute constant k for which the following 
holds. For every < e < 1/2 there is a class F C B (L 00 (r2)j such that 

. . , k ■ vc(F, e) 

vc(conv(F),e) > 



£ 2 log(l/ £ )- 



Proof. For every integer n, let Q n = and set F n = {ei, e n }, that is, 
the standard unit vectors in M n , when considered as linear functionals on 
B^. Since \F\ = n, it follows that for every e > 0, vc(F n , f2 n , e) < log 2 n. 
On the other hand, conv(i ? n ) = 5™ when considered as functionals on 
B^. By Lemma IP1 applied to X = i^, and since < ify 7 ™ 

jXJlj . it is evident that there is a subset on cardinality n in B"^ which is 
kj y^n-shattered by B™. Thus, for e n = k/^/n, 

, (T p n n ^ fc/ • vc(F n ,O n ,e„) 



el log(l/e 
from which the proof easily follows. 



5 Almost isometric coordinate projections 

Given real-valued function / on a probability space, its -i/^-norm (p > 1) is 
defined as the Orlicz norm corresponding to the function exp(i p ) — 1. Pre- 
cisely, H/ll^p is the infimum of all numbers A satisfying Eexp(|/| p /A p ) < e. 
It is possible to compare the ip p with other ip q norms and the L p norms. 
Indeed, one can show that if 1 < p < q < do, ||/||^_ < Cp^H/H w and 
ll/lkp < C p \\f\\^ (see, for example, [VW]). 

A function / is bounded in the fa norm if and only if / has a subgaussian 
tail. Namely, if ||/||^, 2 < 1 then by Chebychev's inequality P{|/| > t} < 
e -t 2 +i £ or a vj j > o. Conversely, if for some A > 1 one has P{|/| > t} < 
Ae~ f for all t > 1, then integrating by parts it follows that Eexp(//2) 2 < 
1 + A/3 < 2 A , and by Jensen's inequality one can conclude that ||/||^ 2 < 
2A (we did not attempt here to give the right dependence on A). 

Another simple but useful fact which follows from Jensen's inequality 
is that ||/||^, 2 < CEexp(/ 2 ), where C is an absolute constant. 

We will focus on functions defined on a finite domain, which we iden- 
tify with {1, . . . , re}, equipped with a uniform measure, where each atom 
carries a weight of 1/re. We denote the ip2 norm of a function / on this 
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probability space by ||/||^«. Since / is defined on {1, re}, we sometimes 
identify / with the sequence of scalars (f(i))f = i- 

We shall use the following standard probabilistic model for random 
coordinate projections. Given < 5 < 1/2, let 6i,...,5 n be selectors, 
i.e. independent {0, l}-valued random variables with mean 6. Then a = 
{i | 1 < i < n, 5i = 1} is a random subset of the interval {1, . . . , n) with 
average cardinality 5n. 

By Bernstein's inequality |VWj . for every < e < 1, 

P{(l- £ )||/|||n < ^£ ftl/COl" ^ (l+^H/lli}} ^ i-Z^K-Jf^ 

and by another application of Bernstein's inequality, 
1 n 

P{— \ S i - > e} < 2exp(-c£ 2 n<5), (5.1) 

i=l 

implying that if ||/||oo < 1, then with probability at least 4exp(— ce 2 |cr|), 
(l-e)\\f\\ L n<\\P ff f\\ L .<(l + e)\\f\\ L n. 

In this section we relax the assumption that / is bounded in the uniform 
norm, and assume that / is bounded in the ip2 norm. 

Roughly speaking, we show that for every 1 < p < oo, the set of vectors 
in S(Lp) which will be almost isometrically projected onto L a p are those 
with a "small" vb™ norm. 

Proposition 5.1 Let {bifl =1 be independent {0, l}-valued random vari- 
ables with mean 5 > 0. Set a = (««)?= i G W 1 and put M = ||a||^,™. Then, 
for every positive number t < M/2, 



n 

"{ y^X&i - &) a i > - ex P 



ct 2 5n 



M 2 

i=l 

where c is an absolute constant. 

The proof starts with the following standard lemma. 
Lemma 5.2 Let Z be a random variable and assume that for some b, A > 

Eexp(AZ) < e b2x \ 

Then 

F{Z > 2b 2 X} < e- b2x2 . 
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Proof. For t > 0, 

F{Z >t} = P{exp(A(Z - t)) > 1} < Eexp(A(Z - t)) 
= e- A *Eexp(AZ) < e b2x2 - xt . 

Setting t = 2b 2 X completes the proof. ■ 

Proof of Proposition [5TT1 By homogeneity, we can assume that M = 1, 
and we shall evaluate Eexp(i Y^i=\(&i ~ 3) a i)- To that end, let 8[ be an 
independent copy of 5i and set 5i = 5i — 5' { . By Jensen's inequality, 

n n n 

Eexp(t^(<5i - 8)ai) < Eexp(t^(5j - 5-)^) = JjEexp^Oi) = E. 

i=l i=l i=l 

Set 5 = 5(1 — 5) and note that 5i is with probability 1 — 25, and 1 and 
— 1, each with probability 5. Therefore, 

Eexp(t^ai) = (1 - 25) + 5e tai + 5e~ ta * = 1 + 2J(cosh(ta;) - 1). 

Since coshx < 1 + ^x 2 e^ for all real x, then 

n n n 

E < H(l + dPtfe*™) < JJexp^afe'W) = exp {ifn ■ L^aje^A. 

The normalized sum is estimated by Cauchy-Schwartz and using the fact 
that 2t < 1: 

i=l i=l i=l 



& 

i=l 



< 2||a|||n < C, 
because c||a||Lj < ||a||^ < 1. Hence, 

E < exp(C5t 2 n) < exp(C'5t 2 n). 
We put this in a form convenient for applying Lemma 15.21 



1 n C 
Eexp (t5n ■ — ^^(^i — 5)a^j < exp ^— (t5ri" 2 

i=l 
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and apply the lemma for A = tSn and b 2 = It follows that for every 
t > 0, 



1 

p( — - 8)ai > 2ct] < exp(-c<5t 2 n), 

I on z — ' 

i=l 

which completes the proof. 



Corollary 5.3 Applying Proposition ^. 1\ to —en it is evident that 

ct 2 5n^ 



|j ^^(^i — S)ai > tSnj < 2exp 



i=i 



M 2 



where M = ||a||^>« and < t < M/2. 



An easy application of this corollary is the fact that the ^2 -norm of 
points on the sphere determines the cardinality of an almost isometric 
projection. 

Corollary 5.4 There is an absolute constant C for which the following 
holds. For every integer n, any f G S^L^) and every e > 0, a random set 
a C {l,...,n} of average cardinality (CM/e) 2 satisfies with probability at 
least 1/2 that 

l-e< \\Paf\\ L *<l + e, 

where M - 



Proof. The proof follows immediately from Corollarv l5.31 by taking aj = 
f 2 (i) and 5n = (CM/e) 2 , and applying (|5.1[) . ■ 

Note that a similar result can be easily derived for any 1 < p < 00, 
simply by the fact that ||(ai)||^n = ||(af)||,/4 l - 

Corollary 15.31 can be used to present a new insight to the well known 
Johnson-Lindenstrauss "Flattening" Lemma |.TLj . which states that every 
set {xi, ...,x n } C £2 can be 1 + e isometrically embedded in i^, where 
m < (C/e) 2 logn. One can formulate the Johnson-Lindenstrauss Lemma 
as follows: 

Theorem 5.5 There is an absolute constant C for which the following 
holds. For every f\, f n £ S(L^) and every e > there is an orthogonal 
operator O and a set a C {l,...,n} of cardinality at most (C /e) 2 logn, 
such that for all 1 < i < n, 

l-e< \\P a Ofi\\ L * <l+e. 
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As Corollary 15.41 shows, an almost isometric coordinate projection of / 
is possible, as long as ||/||^,« is small; hence, the i/j^ norm defines a "good 
region" on the sphere for which a random coordinate projection will be an 
almost isometry. In a similar way, this can also be performed with many 
functions simultaneously: 

Corollary 5.6 There is an absolute constant C for which the following 
holds. For every fi,...,f n G S^L^) and every e > a random set a C 
{1, n} of cardinality (CM/e) 2 log n satisfies that with probability at least 
1/2, 

l-e< \\P<rfi\\L$ < 1 + e, l<i<n. 

where M = max, 

Proof. As in Corollary 15.41 but taking 5n = (CM/e) 2 log n, we obtain 
then for every 1 < i < n 

Pr{l - e < \\PafiUi < 1 + e} > 1 - -L. 

Then 

Pr{Vl < i < n, l-e< ||P ff /i||i S < 1 + e} > 1/2, 

which completes the proof. ■ 

The connection to the Johnson-Lindenstrauss Lemma is easy: with high 
probability, a random orthogonal operator O will map any set of n vectors 
on the sphere to the "good region" , i.e. to the region where the i/j^ norm 
is bounded by an absolute constant. 

Lemma 5.7 There is an absolute constant C such that for every integer 
n and any x E ^(L^); 

Pro n {\\0*U? > C} < ± 
where the probability measure is the Haar measure on the orthogonal group. 
As a consequence, for every fi,.-,f n £ S^LJj), 

max||0/j|Ln < C 

i £ 

with probability greater than 1/2, and thus Theorem 15.51 is implied by 
Corollary 15.61 
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Proof. Clearly, it suffices to show that there is an absolute constant C 
such that 

PrixeS"- 1 : \\x\\^>^=}< 

2 In 

Consider the function g : S 71 ^ 1 — > R defined by g(x) = To esti- 

mate its Lipschitz constant, observe that for every x G S n , \\x\\^ < 
a/2/ log n. Indeed, for < x < 1, n 2- ' 2 / 2 < nx 2 + 1; hence, 

i=l i=l i=l 

To bound the expectation of 5 (with respect to the Haar measure on 
the sphere), recall the median of the function f(x) = y/n\xi\ satisfies that 
Mf ~ c, and that ||/||h p < 1. Hence, by concentration of measure on the 
sphere |MiSj . for any s > c and every 1 < i < n, 



Pr{x e S' 




and thus, Eexp(cnx 2 ) < 2 for an appropriate absolute constant c. Re- 
call that there is an absolute constant K such that for every function /, 
||/||^, 2 < KEexp(f 2 ); therefore, for x = (xi, ...,x n ), 

K n 

Hv^^lUa 1 < — y^exp(cnx 2 ). 

Tl . 
i=l 

Taking the expectation with respect to x on the sphere, 

_ K n 

EllVndLn < — VEexp(cnx 2 ) < K' 
2 11 

i=l 

for an absolute constant K'. 

By the concentration of measure on the sphere applied to the function 

9, 

Pr{x G S- 1 : ||x||^ > + 1} < ^L e ~^o g n 
and the claim follows by selecting t = C'/y/n. ■ 
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